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(F) ZIP : 94104 
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(ix) TELECOMMUNICATION INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATGAGGGGGA TGCTGCCCCT CTTTGAGCCC AAGGGCCGGG TCCTCCTGGT GGACGGCCAC 
CACCTGGCCT ACCGCACCTT CCACGCCCTG AAGGGCCTCA CCACCAGCCG GGGGGAGCCG 
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GTGCAGGCGG 


TCTACGGCTT 


CGCCAAGAGC 


CTCCTCAAGG 


CCCTCAAGGA 


GGACGGGGAC 


180 




GCGGTGATCG 


TGGTCTTTGA 


CGCCAAGGCC 


CCCTCCTTCC 


GCCACGAGGC 


CTACGGGGGG 


240 




TACAAGGCGG 


GCCGGGCCCC 


CACGCCGGAG 


GACTTTCCCC 


GGCAACTCGC 


CCTCATCAAG 


300 




GAGCTGGTGG 


ACCTCCTGGG 


GCTGGCGCGC 


CTCGAGGTCC 


CGGGCTACGA 


GGCGGACGAC 


360 




GTCCTGGCCA 


GCCTGGCCAA 


GAAGGCGGAA 


AAGGAGGGCT 


ACGAGGTCCG 


CATCCTCACC 


420 




GCCGACAAAG 


ACCTTTACCA 


GCTCCTTTCC 


GACCGCATCC 


ACGTCCTCCA 


CCCCGAGGGG 


480 




TACCTCATCA 


CCCCGGCCTG 


GCTTTGGGAA 


AAGTACGGCC 


TGAGGCCCGA 


CCAGTGGGCC 


540 




GACTACCGGG 


CCCTGACCGG 


GGACGAGTCC 


GACAACCTTC 


CCGGGGTCAA 


GGGCATCGGG 


600 




GAGAAGACGG 


CGAGGAAGCT 


TCTGGAGGAG 


TGGGGGAGCC 


TGGAAGCCCT 


CCTCAAGAAC 


660 




CTGGACCGGC 


TGAAGCCCGC 


CATCCGGGAG 


AAGATCCTGG 


CCCACATGGA 


CGATCTGAAG 


720 




CTCTCCTGGG 


ACCTGGCCAA 


GGTGCGCACC 


GACCTGCCCC 


TGGAGGTGGA 


CTTCGCCAAA 


780 


P 

pat 

P 


AGGCGGGAGC 


CCGACCGGGA 


GAGGCTTAGG 


GCCTTTCTGG 


AGAGGCTTGA 


GTTTGGCAGC 


840 


Sj 


CTCCTCCACG 


AGTTCGGCCT 


TCTGGAAAGC 


CCCAAGGCCC 


TGGAGGAGGC 


CCCCTGGCCC 


900 


gfys 

Ul 


CCGCCGGAAG 


GGGCCTTCGT 


GGGCTTTGTG 


CTTTCCCGCA 


AGGAGCCCAT 


GTGGGCCGAT 


960 


ni 
Ki 

E 


CTTCTGGCCC 


TGGCCGCCGC 


CAGGGGGGGC 


CGGGTCCACC 


GGGCCCCCGA 


GCCTTATAAA 


1020 


GCCCTCAGGG 


ACCTGAAGGA 


GGCGCGGGGG 


CTTCTCGCCA AAGACCTGAG 


CGTTCTGGCC 


1080 


F= : 

ru 


CTGAGGGAAG 


GCCTTGGCCT 


CCCGCCCGGC 


GACGACCCCA 


TGCTCCTCGC 


CTACCTCCTG 


1140 




GACCCTTCCA 


ACACCACCCC 


CGAGGGGGTG 


GCCCGGCGCT 


ACGGCGGGGA 


GTGGACGGAG 


1200 


ru 


GAGGCGGGGG 


AGCGGGCCGC 


CCTTTCCGAG 


AGGCTCTTCG 


CCAACCTGTG 


GGGGAGGCTT 


1260 


ft) 


GAGGGGGAGG 


AGAGGCTCCT 


TTGGCTTTAC 


CGGGAGGTGG 


AGAGGCCCCT 


TTCCGCTGTC 


1320 




CTGGCCCACA 


TGGAGGCCAC 


GGGGGTGCGC 


CTGGAGGTGG 


CCTATCTCAG 


GGCCTTGTCC 


1380 




CTGGAGGTGG 


CCGAGGAGAT 


CGCCCGCCTC 


GAGGCCGAGG 


TCTTCCGCCT 


GGCCGGCCAC 


1440 




CCCTTCAACC 


TCAACTCCCG 


GGACCAGCTG 


GAAAGGGTCC 


TCTTTGACGA 


GCTAGGGCTT 


1500 




CCCGCCATCG 


GCAAGACGGA 


GAAGACCGGC 


AAGCGCTCCA 


CCAGCGCCGC 


CGTCCTGGAG 


1560 




GCCCTCCGCG 


AGGCCCACCC 


CATCGTGGAG 


AAGATCCTGC 


AGTACCGGGA 


GCTCACCAAG 


1620 




CTGAAGAGCA 


CCTACATTGA 


CCCCTTGCCG 


GACCTCATCC 


ACCCCAGGAC 


GGGCCGCCTC 


1680 








GGCCACGGCC 


ACGGGCAGGC 


TAAGTAGCTC 


CGATCCCAAC 


1740 




CTCCAGAACA 


TCCCCGTCCG 


CACCCCGCTT 


GGGCAGAGGA 


TCCGCCGGGC 


CTTCATCGCC 


1800 




GAGGAGGGGT 


GGCTATTGGT 


GGCCCTGGAC 


TATAGCCAGA 


TAGAGCTCAG 


GGTGCTGGCC 


1860 




CACCTCTCCG 


GCGACGAGAA 


CCTGATCCGG 


GTCTTCCAGG 


AGGGGCGGGA 


CATCCACACG 


1920 




GAGACCGCCA 


. GCTGGATGTT 


CGGCGTCCCC 


CGGGAGGCCG 


TGGACCCCCT 


GATGCGCCGG 


1980 
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GCGGCCAAGA CCATCAACTT CGGGGTCCTC TACGGCATGT CGGCCCACCG CCTCTCCCAG 204 0 

GAGCTAGCCA TCCCTTACGA GGAGGCCCAG GCCTTCATTG AGCGCTACTT TCAGAGCTTC 2100 

CCCAAGGTGC GGGCCTGGAT TGAGAAGACC CTGGAGGAGG GCAGGAGGCG GGGGTACGTG 2160 

GAGACCCTCT TCGGCCGCCG CCGCTACGTG CCAGACCTAG AGGCCCGGGT GAAGAGCGTG 2220 

CGGGAGGCGG CCGAGCGCAT GGCCTTCAAC ATGCCCGTCC AGGGCACCGC CGCCGACCTC 2280 

ATGAAGCTGG CTATGGTGAA GCTCTTCCCC AGGCTGGAGG AAATGGGGGC CAGGATGCTC 234 0 

CTTCAGGTCC ACGACGAGCT GGTCCTCGAG GCCCCAAAAG AGAGGGCGGA GGCCGTGGCC 2400 

CGGCTGGCCA AGGAGGTCAT GGAGGGGGTG TATCCCCTGG CCGTGCCCCT GGAGGTGGAG 2460 

GTGGGGATAG GGGAGGACTG GCTCTCCGCC AAGGAGTGAT ACCACC 2 506 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2496 base pairs 
fc»l (B) TYPE: nucleic acid 

□ (C) STRANDEDNESS : double 

\\ (D) TOPOLOGY: linear 



N 5 



■F 
Ui 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



ni 
m 



ATGGCGATGC 


TTCCCCTCTT 


TGAGCCCAAA 


GGCCGCGTGC 


TCCTGGTGGA 


CGGCCACCAC 


60 


CTGGCCTACC 


GCACCTTCTT 


TGCCCTCAAG 


GGCCTCACCA 


CCAGCCGCGG 


CGAACCCGTT 


120 


CAGGCGGTCT 


ACGGCTTCGC 


CAAAAGCCTC 


CTCAAGGCCC 


TGAAGGAGGA 


CGGGGACGTG 


180 


GTGGTGGTGG 


TCTTTGACGC 


CAAGGCCCCC 


TCCTTCCGCC 


ACGAGGCCTA 


CGAGGCCTAC 


240 


AAGGCGGGCC 


GGGCCCCCAC 


CCCGGAGGAC 


TTTCCCCGGC 


AGCTGGCCCT 


CATCAAGGAG 


300 


TTGGTGGACC 


TCCTAGGCCT 


TGTGCGGCTG 


GAGGTTCCCG 


GCTTTGAGGC 


GGACGACGTG 


360 


CTGGCCACCC 


TGGCCAAGCG 


GGCGGAAAAG 


GAGGGGTACG 


AGGTGCGCAT 


CCTCACTGCC 


420 


GACCGCGACC 


TCTACCAGCT 


CCTTTCGGAG 


CGCATCGCCA 


TCCTCCACCC 


TGAGGGGTAC 


480 


CTGATCACCC 


CGGCGTGGCT 


TTACGAGAAG 


TACGGCCTGC 


GCCCGGAGCA 


GTGGGTGGAC 


540 


TACCGGGCCC 


TGGCGGGGGA 


CCCCTCGGAT 


AACATCCCCG 


GGGTGAAGGG 


CATCGGGGAG 


600 


AAGACCGCCC 


AGAGGCTCAT 


CCGCGAGTGG 


GGGAGCCTGG 


AAAACCTCTT 


CCAGCACCTG 


660 


GACCAGGTGA 


AGCCCTCCTT 


GCGGGAGAAG 


CTCCAGGCGG 


GCATGGAGGC 


CCTGGCCCTT 


720 


TCCCGGAAGC 


TTTCCCAGGT 


GCACACTGAC 


CTGCCCCTGG 


AGGTGGACTT 


CGGGAGGCGC 


780 


CGCACACCCA 


ACCTGGAGGG 


TCTGCGGGCT 


TTTTTGGAGC 


GGTTGGAGTT 


TGGAAGCCTC 


840 


CTCCACGAGT 


TCGGCCTCCT 


GGAGGGGCCG 


AAGGCGGCAG 


AGGAGGCCCC 


CTGGCCCCCT 


900 


CCGGAAGGGG 


CTTTTTTGGG 


CTTTTCCTTT 


TCCCGTCCCG 


AGCCCATGTG 


GGCCGAGCTT 


960 
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# 



CTGGCCCTGG CTGGGGCGTG GGAGGGGCGC CTCCATCGGG CACAAGACCC CCTTAGGGGC 1020 

CTGAGGGACC TTAAGGGGGT GCGGGGAATC CTGGCCAAGG ACCTGGCGGT TTTGGCCCTG 1080 

CGGGAGGGCC TGGACCTCTT CCCAGAGGAC GACCCCATGC TCCTGGCCTA CCTTCTGGAC 1140 

CCCTCCAACA CCACCCCTGA GGGGGTGGCC CGGCGTTACG GGGGGGAGTG GACGGAGGAT 1200 

GCGGGGGAGA GGGCCCTCCT GGCCGAGCGC CTCTTCCAGA CCCTAAAGGA GCGCCTTAAG 1260 

GGAGAAGAAC GCCTGCTTTG GCTTTACGAG GAGGTGGAGA AGCCGCTTTC CCGGGTGTTG 1320 

GCCCGGATGG AGGCCACGGG GGTCCGGCTG GACGTGGCCT ACCTCCAGGC CCTCTCCCTG 1380 

GAGGTGGAGG CGGAGGTGCG CCAGCTGGAG GAGGAGGTCT TCCGCCTGGC CGGCCACCCC 144 0 

TTCAACCTCA ACTCCCGCGA CCAGCTGGAG CGGGTGCTCT TTGACGAGCT GGGCCTGCCT 1500 

GCCATCGGCA AGACGGAGAA GACGGGGAAA CGCTCCACCA GCGCTGCCGT GCTGGAGGCC 1560 

yi CTGCGAGAGG CCCACCCCAT CGTGGACCGC ATCCTGCAGT ACCGGGAGCT CACCAAGCTC 1620 

JSj AAGAACACCT ACATAGACCC CCTGCCCGCC CTGGTCCACC CCAAGACCGG CCGGCTCCAC 1680 

\\ ACCCGCTTCA ACCAGACGGC CACCGCCACG GGCAGGCTTT CCAGCTCCGA CCCCAACCTG 1740 

£ 

i". CAGAACATCC CCGTGCGCAC CCCTCTGGGC CAGCGCATCC GCCGAGCCTT CGTGGCCGAG 1800 

W 

W GAGGGCTGGG TGCTGGTGGT CTTGGACTAC AGCCAGATTG AGCTTCGGGT CCTGGCCCAC 1860 

m 

s CTCTCCGGGG ACGAGAACCT GATCCGGGTC TTTCAGGAGG GGAGGGACAT CCACACCCAG 192 0 

£"j ACCGCCAGCT GGATGTTCGG CGTTTCCCCC GAAGGGGTAG ACCCTCTGAT GCGCCGGGCG 1980 

i V 4 

\& GCCAAGACCA TCAACTTCGG GGTGCTCTAC GGCATGTCCG CCCACCGCCT CTCCGGGGAG 2040 

ft) 

CTTTCCATCC CCTACGAGGA GGCGGTGGCC TTCATTGAGC GCTACTTCCA GAGCTACCCC 2100 

fU AAGGTGCGGG CCTGGATTGA GGGGACCCTC GAGGAGGGCC GCCGGCGGGG GTATGTGGAG 2160 

ACCCTCTTCG GCCGCCGGCG CTATGTGCCC GACCTCAACG CCCGGGTGAA GAGCGTGCGC 2220 

GAGGCGGCGG AGCGCATGGC CTTCAACATG CCGGTCCAGG GCACCGCCGC CGACCTCATG 2280 

AAGCTGGCCA TGGTGCGGCT TTTCCCCCGG CTTCAGGAAC TGGGGGCGAG GATGCTTTTG 2340 

CAGGTGCACG ACGAGCTGGT CCTCGAGGCC CCCAAGGACC GGGCGGAGAG GGTAGCCGCT 24 00 

TTGGCCAAGG AGGTCATGGA GGGGGTCTGG CCCCTGCAGG TGCCCCTGGA GGTGGAGGTG 2460 

GGCCTGGGGG AGGACTGGCT CTCCGCCAAG GAGTAG 24 96 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 





ATGGAGGCGA 


TGCTTCCGCT 


CTTTGAACCC 


AAAGGCCGGG 


TCCTCCTGGT 


GGACGGCCAC 


60 




CACCTGGCCT 


ACCGCACCTT 


CTTCGCCCTG 


AAGGGCCTCA 


CCACGAGCCG 


GGGCGAACCG 


120 




GTGCAGGCGG 


TCTACGGCTT 


CGCCAAGAGC 


CTCCTCAAGG 


CCCTGAAGGA 


GGACGGGTAC 


180 




AAGGCCGTCT 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGAG 


240 




GCCTACAAGG 


CGGGGAGGGC 


CCCGACCCCC 


GAGGACTTCC 


CCCGGCAGCT 


CGCCCTCATC 


300 




AAGGAGCTGG 


TGGACCTCCT 


GGGGTTTACC 


CGCCTCGAGG 


TCCCCGGCTA 


CGAGGCGGAC 


360 




GACGTTCTCG 


CCACCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GGTACGAGGT 


GCGCATCCTC 


420 




ACCGCCGACC 


GCGACCTCTA 


CCAACTCGTC 


TCCGACCGCG 


TCGCCGTCCT 


CCACCCCGAG 


480 




GGCCACCTCA 


TCACCCCGGA 


GTGGCTTTGG 


GAGAAGTACG 


GCCTCAGGCC 


GGAGCAGTGG 


540 




GTGGACTTCC 


GCGCCCTCGT 


GGGGGACCCC 


TCCGACAACC 


TCCCCGGGGT 


CAAGGGCATC 


600 




GGGGAGAAGA 


CCGCCCTCAA 


GCTCCTCAAG 


GAGTGGGGAA 


GCCTGGAAAA 


CCTCCTCAAG 


660 


SI 


AACCTGGACC 


GGGTAAAGCC 


AGAAAACGTC 


CGGGAGAAGA 


TCAAGGCCCA 


CCTGGAAGAC 


720 


is! 


CTCAGGCTCT 


CCTTGGAGCT 


CTCCCGGGTG 


CGCACCGACC 


TCCCCCTGGA 


GGTGGACCTC 


780 




GCCCAGGGGC 


GGGAGCCCGA 


CCGGGAGGGG 


CTTAGGGCCT 


TCCTGGAGAG 


GCTGGAGTTC 


840 




GGCAGCCTCC 


TCCACGAGTT 


CGGCCTCCTG 


GAGGCCCCCG 


CCCCCCTGGA 


GGAGGCCCCC 


900 


0 
fy 


TGGCCCCCGC 


CGGAAGGGGC 


CTTCGTGGGC 


TTCGTCCTCT 


CCCGCCCCGA 


GCCCATGTGG 


960 


M 


GCGGAGCTTA 


AAGCCCTGGC CGCCTGCAGG 


GACGGCCGGG 


TGCACCGGGC 


AGCAGACCCC 


1020 


ni 


TTGGCGGGGC 


TAAAGGACCT 


CAAGGAGGTC 


CGGGGCCTCC 


TCGCCAAGGA 


CCTCGCCGTC 


1080 


ru 


TTGGCCTCGA 


GGGAGGGGCT 


AGACCTCGTG 


CCCGGGGACG 


ACCCCATGCT 


CCTCGCCTAC 


1140 




CTCCTGGACC 


CCTCCAACAC 


CACCCCCGAG 


GGGGTGGCGC 


GGCGCTACGG 


GGGGGAGTGG 


1200 




ACGGAGGACG 


CCGCCCACCG 


GGCCCTCCTC 


TCGGAGAGGC 


TCCATCGGAA 


CCTCCTTAAG 


1260 




CGCCTCGAGG 


GGGAGGAGAA 


GCTCCTTTGG 


CTCTACCACG 


AGGTGGAAAA 


GCCCCTCTCC 


1320 




CGGGTCCTGG 


CCCACATGGA 


GGCCACCGGG 


GTACGGCTGG 


ACGTGGCCTA 


CCTTCAGGCC 


1380 




CTTTCCCTGG 


AGCTTGCGGA 


GGAGATCCGC 


CGCCTCGAGG 


AGGAGGTCTT 


CCGCTTGGCG 


1440 




GGCCACCCCT 


TCAACCTCAA 


CTCCCGGGAC 


C AGC TCjCj AAA 


/~»^**^irp/~i/ nn/"i« ■ »J ■ > 
CjQj(j I KjL. 1 U 1 1 




X D \J \J 




AGGCTTCCCG 


CCTTGGGGAA 


GACGCAAAAG 


ACAGGCAAGC 


GCTCCACCAG 


CGCCGCGGTG 


1560 




CTGGAGGCCC 


TACGGGAGGC 


CCACCCCATC 


GTGGAGAAGA 


TCCTCCAGCA 


CCGGGAGCTC 


1620 




ACCAAGCTCA 


AGAACACCTA 


CGTGGACCCC 


CTCCCAAGCC 


TCGTCCACCC 


GAGGACGGGC 


1680 




CGCCTCCACA 


CCCGCTTCAA 


CCAGACGGCC 


ACGGCCACGG 


GGAGGCTTAG 


TAGCTCCGAC 


1740 




CCCAACCTGC 


AGAACATCCC 


CGTCCGCACC 


CCCTTGGGCC 


AGAGGATCCG 


CCGGGCCTTC 


1800 
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GTGGCCGAGG 


CGGGTTGGGC 


GTTGGTGGCC 


CTGGACTATA 


GCCAGATAGA 


GCTCCGCGTC 


1860 


CTCGCCCACC 


TCTCCGGGGA 


CGAAAACCTG 


ATCAGGGTCT 


TCCAGGAGGG 


GAAGGACATC 


1920 


CACACCCAGA 


CCGCAAGCTG 


GATGTTCGGC 


GTCCCCCCGG 


AGGCCGTGGA 


CCCCCTGATG 


1980 


CGCCGGGCGG 


CCAAGACGGT 


GAACTTCGGC 


GTCCTCTACG 


GCATGTCCGC 


CCATAGGCTC 


2040 


TCCCAGGAGC 


TTGCCATCCC 


CTACGAGGAG 


GCGGTGGCCT 


TTATAGAGGC 


TACTTCCAAA 


2100 


GCTTCCCCAA 


GGTGCGGGCC 


TGGATAGAAA 


AGACCCTGGA 


GGAGGGGAGG 


AAGCGGGGCT 


2160 


ACGTGGAAAC 


CCTCTTCGGA 


AGAAGGCGCT 


ACGTGCCCGA 


CCTCAACGCC 


CGGGTGAAGA 


2220 


GCGTCAGGGA 






TPAAPATGPP 




f\\~ \~ vj V- v_ Li L. L. Vj 


Toon 


ACCTCATGAA 


GCTCGCCATG 


GTGAAGCTCT 


TCCCCCGCCT 


CCGGGAGATG 


GGGGCCCGCA 


2340 


TGCTCCTCCA 


GGTCCACGAC 


GAGCTCCTCC 


TGGAGGCCCC 


CCAAGCGCGG 


GCCGAGGAGG 


2400 


TGGCGGCTTT 


GGCCAAGGAG 


GCCATGGAGA 


AGGCCTATCC 


CCTCGCCGTG 


CCCCTGGAGG 


2460 


TGGAGGTGGG 


GATGGGGGAG 


GACTGGCTTT 


CCGCCAAGGG 


TTAG 




2504 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp 
130 135 140 
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Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu 
145 150 ~ 155 



His Pro Glu Gly 
160 



Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr 
165 " 170 



Gly Leu Arg Pro 
175 



Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp 
180 18 5 



Glu Ser Asp Asn 
190 



Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala 
195 200 



Arg Lys Leu Leu 
205 



Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu 
210 215 220 



Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met 
225 230 235 



Asp Asp Leu Lys 
240 



Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu 
245 * 250 

Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg 
260 265 



Pro Leu Glu Val 
255 

Leu Arg Ala Phe 

270 



M 

UJ 



Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 



Phe Gly Leu Leu 
285 



Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 



hi 
m 

Q 

R J 

IV 



Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro 
305 310 ~ 315 

Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val 
325 330 

Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 

Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 



Met Trp Ala Asp 
320 

His Arg Ala Pro 
335 

Arg Gly Leu Leu 
350 

Leu Gly Leu Pro 
365 



Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 



Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 ~ ' 395 



Glu Trp Thr Glu 
400 



Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 



Phe Ala Asn Leu 
415 



Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 



Leu Tyr Arg Glu 
430 



Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 



Glu Ala Thr Gly 
445 



Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala 
450 455 460 



Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 



Leu Ala Gly His 
480 
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Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 
530 535 540 

Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg Leu 
545 ~ 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val Ala 
0 595 600 605 

□ 

SJ Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 

610 615 620 

W Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His Thr 

fll 625 630 635 640 

Si 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
L 645 650 655 



fy Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 

m 



m 



660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
72 5 ~ 73 0 73 5 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His 
770 ~ 775 780 

Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 
785 790 795 800 

Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 
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Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Val Val Val Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Arg Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 * ~ 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ala lie Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Ala Trp Leu Tyr Glu Lys Tyr Gly Leu Arg Pro Glu 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Gin Arg Leu lie Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Phe Gin His Leu Asp Gin Val Lys 
210 215 220 

Pro Ser Leu Arg Glu Lys Leu Gin Ala Gly Met Glu Ala Leu Ala Leu 
225 230 235 240 

Ser Arg Lys Leu Ser Gin Val His Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 
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Phe Gly Arg Arg Arg Thr Pro Asn Leu Glu Gly Leu Arg Ala Phe Leu 
260 265 270 

Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu 
275 280 285 

Gly Pro Lys Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala 
290 295 300 

Phe Leu Gly Phe Ser Phe Ser Arg Pro Glu Pro Met Trp Ala Glu Leu 
305 310 315 320 

Leu Ala Leu Ala Gly Ala Trp Glu Gly Arg Leu His Arg Ala Gin Asp 
325 330 335 

Pro Leu Arg Gly Leu Arg Asp Leu Lys Gly Val Arg Gly lie Leu Ala 
340 345 350 

Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Phe Pro 
355 360 365 

g Glu Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr 

jS{ 370 375 380 

S| Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp 

*p 385 390 395 " 400 

Li! 

Ala Gly Glu Arg Ala Leu Leu Ala Glu Arg Leu Phe Gin Thr Leu Lys 
405 410 415 



m 

09 



Glu Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu Val 
420 425 430 

Glu Lys Pro Leu Ser Arg Val Leu Ala Arg Met Glu Ala Thr Gly Val 



h= 435 440 445 

ru 

f»l Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val Glu Ala 

™ 450 455 460 

fU 



Glu Val Arg Gin Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro 
465 470 475 ~ 480 

Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu 
485 490 495 

Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser 
500 505 * 510 

Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val 
515 520 525 

Asp Arg lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr 
530 535 540 

lie Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Gly Arg Leu His 
545 550 555 * 560 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 
565 570 575 

Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 590 
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He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Val Leu Val Val Leu 
595 600 605 

Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 
610 615 620 

Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr Gin 
625 630 635 640 

Thr Ala Ser Trp Met Phe Gly Val Ser Pro Glu Gly Val Asp Pro Leu 
645 650 655 

Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met 
660 * 665 670 

Ser Ala His Arg Leu Ser Gly Glu Leu Ser He Pro Tyr Glu Glu Ala 
675 680 685 

Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg Ala 
690 695 700 

Trp He Glu Gly Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu 
705 710 715 720 

Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val 
725 ~ 730 735 

Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 
740 745 750 

Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Arg Leu Phe 
755 760 765 

Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His Asp 
770 775 780 

Glu Leu Val Leu Glu Ala Pro Lys Asp Arg Ala Glu Arg Val Ala Ala 
785 790 795 800 

Leu Ala Lys Glu Val Met Glu Gly Val Trp Pro Leu Gin Val Pro Leu 
805 810 815 

Glu Val Glu Val Gly Leu Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 
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Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 ' 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg 
130 135 " 140 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
145 150 155 160 

Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 

Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu Glu Asp 
225 230 235 240 

Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 
245 250 * 255 

Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 
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Leu Val Pro Gly Asp 
370 

Ser Asn Thr Thr Pro 
385 

Thr Glu Asp Ala Ala 
405 

Asn Leu Leu Lys Arg 
420 

His Glu Val Glu Lys 
435 

Thr Gly Val Arg Leu 
450 

Leu Ala Glu Glu lie 
465 

Gly His Pro Phe Asn 
485 



Phe Asp Glu Leu Arg 
500 

Lys Arg Ser Thr Ser 
515 

Pro lie Val Glu Lys 
530 

Asn Thr Tyr Val Asp 
545 

Arg Leu His Thr Arg 
565 

Ser Ser Ser Asp Pro 
580 

Gly Gin Arg lie Arg 
595 



Val Ala Leu Asp Tyr 
610 

Ser Gly Asp Glu Asn 
625 

His Thr Gin Thr Ala 
645 

Asp Pro Leu Met Arg 
660 

Tyr Gly Met Ser Ala 
675 

Glu Glu Ala Val Ala 
690 



Asp Pro Met Leu Leu Ala 
375 

Glu Gly Val Ala Arg Arg 
390 395 

His Arg Ala Leu Leu Ser 
410 

Leu Glu Gly Glu Glu Lys 
425 

Pro Leu Ser Arg Val Leu 
440 

Asp Val Ala Tyr Leu Gin 
455 

Arg Arg Leu Glu Glu Glu 
470 475 

Leu Asn Ser Arg Asp Gin 
490 

Leu Pro Ala Leu Gly Lys 
505 

Ala Ala Val Leu Glu Ala 
520 

lie Leu Gin His Arg Glu 
535 

Pro Leu Pro Ser Leu Val 
550 555 

Phe Asn Gin Thr Ala Thr 
570 

Asn Leu Gin Asn lie Pro 
585 

Arg Ala Phe Val Ala Glu 
600 

Ser Gin lie Glu Leu Arg 
615 

Leu lie Arg Val Phe Gin 
630 635 

Ser Trp Met Phe Gly Val 
650 

Arg Ala Ala Lys Thr Val 
665 

His Arg Leu Ser Gin Glu 
680 

Phe lie Glu Arg Tyr Phe 
695 



Tyr Leu Leu Asp Pro 
380 

Tyr Gly Gly Glu Trp 
400 

Glu Arg Leu His Arg 
415 

Leu Leu Trp Leu Tyr 
430 

Ala His Met Glu Ala 
445 

Ala Leu Ser Leu Glu 
460 

Val Phe Arg Leu Ala 
480 

Leu Glu Arg Val Leu 
495 

Thr Gin Lys Thr Gly 
510 

Leu Arg Glu Ala His 
525 

Leu Thr Lys Leu Lys 
540 

His Pro Arg Thr Gly 
560 

Ala Thr Gly Arg Leu 
575 

Val Arg Thr Pro Leu 
590 

Ala Gly Trp Ala Leu 
605 

Val Leu Ala His Leu 
620 

Glu Gly Lys Asp lie 
640 

Pro Pro Glu Ala Val 
655 

Asn Phe Gly Val Leu 
670 

Leu Ala lie Pro Tyr 
685 

Gin Ser Phe Pro Lys 
700 
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Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 ' 815 

M 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
U 820 825 830 

□ 

Sj Lys Gly 

f\ (2) INFORMATION FOR SEQ ID NO : 7 : 

w 

fU (i) SEQUENCE CHARACTERISTICS: 

ff; (A) LENGTH: 2502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
fcJ (D) TOPOLOGY: linear 



E 



m 

d 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ATGNNGGCGA 


TGCTTCCCCT 


CTTTGAGCCC 


AAAGGCCGGG 


TCCTCCTGGT 


GGACGGCCAC 


60 


CACCTGGCCT 


ACCGCACCTT 


CTTCGCCCTG 


AAGGGCCTCA 


CCACCAGCCG 


GGGCGAACCG 


120 


GTGCAGGCGG 


TCTACGGCTT 


CGCCAAGAGC 


CTCCTCAAGG 


CCCTGAAGGA 


GGACGGGGAC 


180 


NNGGCGGTGN 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGAG 


240 


GCCTACAAGG 


CGGGCCGGGC 


CCCCACCCCG 


GAGGACTTTC 


CCCGGCAGCT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGCTTGCG 


CGCCTCGAGG 


TCCCCGGCTA 


CGAGGCGGAC 


360 


GACGTNCTGG 


CCACCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GGTACGAGGT 


GCGCATCCTC 


420 


ACCGCCGACC 


GCGACCTCTA 


CCAGCTCCTT 


TCCGACCGCA 


TCGCCGTCCT 


CCACCCCGAG 


480 


GGGTACCTCA 


TCACCCCGGC 


GTGGCTTTGG 


GAGAAGTACG 


GCCTGAGGCC 


GGAGCAGTGG 


540 


GTGGACTACC 


GGGCCCTGGC 


GGGGGACCCC 


TCCGACAACC 


TCCCCGGGGT 


CAAGGGCATC 


600 


GGGGAGAAGA 


CCGCCCNGAA 


GCTCCTCNAG 


GAGTGGGGGA 


GCCTGGAAAA 


CCTCCTCAAG 


660 


AACCTGGACC 


GGGTGAAGCC 


CGCCNTCCGG 


GAGAAGATCC 


AGGCCCACAT 


GGANGACCTG 


720 
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ANGCTCTCCT 


GGGAGCTNTC 


CCAGGTGCGC 


ACCGACCTGC 


CCCTGGAGGT 


GGACTTCGCC 


780 




AAGNGGCGGG 


AGCCCGACCG 


GGAGGGGCTT 


AGGGCCTTTC 


TGGAGAGGCT 


GGAGTTTGGC 


840 




AGCCTCCTCC 


ACGAGTTCGG 


CCTCCTGGAG 


GGCCCCAAGG 


CCCTGGAGGA 


GGCCCCCTGG 


900 




CCCCCGCCGG 


AAGGGGCCTT 


CGTGGGCTTT 


GTCCTTTCCC 


GCCCCGAGCC 


CATGTGGGCC 


960 




GAGCTTCTGG 


CCCTGGCCGC 


CGCCAGGGAG 


GGCCGGGTCC 


ACCGGGCACC 


AGACCCCTTT 


1020 




ANGGGCCTNA 


GGGACCTNAA 


GGAGGTGCGG 


GGNCTCCTCG 


CCAAGGACCT 


GGCCGTTTTG 


1080 




GCCCTGAGGG 


AGGGCCTNGA 


CCTCNTGCCC 


GGGGACGACC 


CCATGCTCCT 


CGCCTACCTC 


1140 




CTGGACCCCT 


CCAACACCAC 


CCCCGAGGGG 


GTGGCCCGGC 


GCTACGGGGG 


GGAGTGGACG 


1200 




GAGGANGCGG 


GGGAGCGGGC 


CCTCCTNTCC 


GAGAGGCTCT 


TCCNGAACCT 


NNNGCAGCGC 


1260 




CTTGAGGGGG 


AGGAGAGGCT 


CCTTTGGCTT 


TACCAGGAGG 


TGGAGAAGCC 


CCTTTCCCGG 


1320 


M 


GTCCTGGCCC 


ACATGGAGGC 


CACGGGGGTN 


CGGCTGGACG 


TGGCCTACCT 


CCAGGCCCTN 


1380 


.£351. 

D 


TCCCTGGAGG 


TGGCGGAGGA 


GATCCGCCGC 


CTCGAGGAGG 


AGGTCTTCCG 


CCTGGCCGGC 


1440 


M 


CACCCCTTCA 


ACCTCAACTC 


CCGGGACCAG 


CTGGAAAGGG 


TGCTCTTTGA 


CGAGCTNGGG 


1500 


CTTCCCGCCA .TCGGCAAGAC 


GGAGAAGACN 


GGCAAGCGCT 


CCACCAGCGC 


CGCCGTGCTG 


1560 


ry 
00 


GAGGCCCTNC 


GNGAGGCCCA 


CCCCATCGTG 


GAGAAGATCC 


TGCAGTACCG 


GGAGCTCACC 


1620 




AAGCTCAAGA 


ACACCTACAT 


NGACCCCCTG 


CCNGNCCTCG 


TCCACCCCAG 


GACGGGCCGC 


1680 


o 
ru 


CTCCACACCC 


GCTTCAACCA 


GACGGCCACG 


GCCACGGGCA 


GGCTTAGTAG 


CTCCGACCCC 


1740 




AACCTGCAGA 


ACATCCCCGT 


CCGCACCCCN 


CTGGGCCAGA 


GGATCCGCCG 


GGCCTTCGTG 


1800 


ru 

«« 


GCCGAGGAGG. 


GNTGGGTGTT 


GGTGGCCCTG 


GACTATAGCC 


AGATAGAGCT 


CCGGGTCCTG 


1860 


es : 

!U 


GCCCACCTCT 


CCGGGGACGA 


GAACCTGATC 


CGGGTCTTCC 


AGGAGGGGAG 


GGACATCCAC 


1920 




ACCCAGACCG 


CCAGCTGGAT 


GTTCGGCGTC 


CCCCCGGAGG 


CCGTGGACCC 


CCTGATGCGC 


1980 




CGGGCGGCCA 


AGACCATCAA 


CTTCGGGGTC 


CTCTACGGCA 


TGTCCGCCCA 


CCGCCTCTCC 


2040 




CAGGAGCTTG 


CCATCCCCTA 


CGAGGAGGCG 


GTGGCCTTCA 


TTGAGCGCTA 


CTTCCAGAGC 


2100 




TTCCCCAAGG 


TGCGGGCCTG 


GATTGAGAAG 


ACCCTGGAGG 


AGGGCAGGAG 


GCGGGGGTAC 


2160 




GTGGAGACCC 


TCTTCGGCCG 


CCGGCGCTAC 


GTGCCCGACC 


TCAACGCCCG 


GGTGAAGAGC 


2220 




GTGCGGGAGG 


CGGCGGAGCG 




I±t\\^l\ J. V_ V*- Vj 




CC1CCCZCCC1AC 


2280 




CTCATGAAGC 


TGGCCATGGT 


GAAGCTCTTC 


CCCCGGCTNC 


AGGAAATGGG 


GGCCAGGATG 


2340 




CTCCTNCAGG 


TCCACGACGA 


GCTGGTCCTC 


GAGGCCCCCA 


AAGAGCGGGC 


GGAGGNGGTG 


2400 




GCCGCTTTGG 


CCAAGGAGGT 


CATGGAGGGG 


GTCTATCCCC 


TGGCCGTGCC 


CCTGGAGGTG 


2460 




GAGGTGGGGA 


TGGGGGAGGA 


CTGGCTCTCC 


GCCAAGGAGT 


AG 




2502 
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INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Xaa Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
1 5 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val Xaa Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Xaa Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp 
130 135 140 

Leu Tyr Gin Leu Leu Ser Asp Arg lie Ala Val Leu His Pro Glu Gly 
145 150 155 160 

Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

Glu Gin Trp Val Asp Tyr Arg Ala Leu Xaa Gly Asp Pro Ser Asp Asn 
180 185 ~ 190 

Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Xaa Lys Leu Leu 
195 200 205 

Xaa Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg Val 
210 215 220 

Lys Pro Xaa Xaa Arg Glu Lys lie Xaa Ala His Met Glu Asp Leu Xaa 
225 230 235 240 

Leu Ser Xaa Xaa Leu Ser Xaa Val Arg Thr Asp Leu Pro Leu Glu Val 
245 250 255 

Asp Phe Ala Xaa Arg Arg Glu Pro Asp Arg Glu Gly Leu Arg Ala Phe 
260 265 270 
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Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Xaa Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Leu Ala Leu Ala Ala Ala Arg Xaa Gly Arg Val His Arg Ala Xaa 
325 330 ~ 335 

Asp Pro Leu Xaa Gly Leu Arg Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Xaa 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 ~ 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Asp Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Phe Xaa Asn Leu 
405 410 415 

Xaa Xaa Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Xaa Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 "* 540 

Tyr lie Asp Pro Leu Pro Xaa Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg lie Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Xaa Leu Val Ala 
595 600 605 
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ru 



Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 ~ 615 ' 620 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
660 * 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 ~ ' 730 735 



s ] Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 

2l ~ 740 745 750 

W Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 

fU 755 760 765 

«¥i 

^ r Phe Pro Arg Leu Xaa Glu Met Gly Ala Arg Met Leu Leu Gin Val His 

5 770 775 780 

0 

R| Asp Glu Leu Val Leu Glu Ala Pro Lys Xaa Arg Ala Glu Xaa Val Ala 

785 790 795 800 



Ala Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Xaa Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 * 825 830 

Xaa 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT GGTGGACGGC 60 
CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG CCGGGGGGAG 120 
CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA GGAGGACGGG 180 
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# 





GACGCGGTGA 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGGG 


240 




GGGTACAAGG 


CGGGCCGGGC 


CCCCACGCCG 


GAGGACTTTC 


CCCGGCAACT 


CGCCCTCATC 


300 




AAGGAGCTGG 


TGGACCTCCT 


GGGGCTGGCG 


CGCCTCGAGG 


TCCCGGGCTA 


CGAGGCGGAC 


360 




GACGTCCTGG 


CCAGCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GCTACGAGGT 


CCGCATCCTC 


420 




ACCGCCGACA 


. AAGACCTTTA 


CCAGCTCCTT 


TCCGACCGCA 


TCCACGTCCT 


CCACCCCGAG 


480 




GGGTACCTCA 


TCACCCCGGC 


CTGGCTTTGG 


GAAAAGTACG 


GCCTGAGGCC 


CGACCAGTGG 


540 




GCCGACTACC 


GGGCCCTGAC 


CGGGGACGAG 


TCCGACAACC 


TTCCCGGGGT 


CAAGGGCATC 


600 




GGGGAGAAGA 


CGGCGAGGAA 


GCTTCTGGAG 


GAGTGGGGGA 


GCCTGGAAGC 


CCTCCTCAAG 


660 




AACCTGGACC 


GGCTGAAGCC 


CGCCATCCGG 


G AG AAGATC C 


TGGCCCACAT 


GGACGATCTG 


720 




AAGCTCTCCT 


GGGACCTGGC 


CAAGGTGCGC 


ACCGACCTGC 


CCCTGGAGGT 


GGACTTCGCC 


780 




AAAAGGCGGG 


AGCCCGACCG 


GGAGAGGCTT 


AGGGCCTTTC 


TGGAGAGGCT 


TGAGTTTGGC 


840 


ion 


AGCCTCCTCC 


ACGAGTTCGG 


CCTTCTGGAA 


AGCCCCAAGG 


CCCTGGAGGA 


GGCCCCCTGG 


900 


Q 


CCCCCGCCGG 


AAGGGGCCTT 


CGTGGGCTTT 


GTGCTTTCCC 


GCAAGGAGCC 


CATGTGGGCC 


960 


i.i 


GATCTTCTGG 


CCCTGGCCGC 


CGCCAGGGGG 


GGCCGGGTCC 


ACCGGGCCCC 


CGAGCCTTAT 


1020 


ry 


AAAGCCCTCA 


GGGACCTGAA 


GGAGGCGCGG 


GGGCTTCTCG 


CCAAAGACCT 


GAGCGTTCTG 


1080 




GCCCTGAGGG 


AAGGCCTTGG 


CCTCCCGCCC 


GGCGACGACC 


CCATGCTCCT 


CGCCTACCTC 


1140 


□ 


CTGGACCCTT 


CCAACACCAC 


CCCCGAGGGG 


GTGGCCCGGC 


GCTACGGCGG 


GGAGTGGACG 


1200 




GAGGAGGCGG 


GGGAGCGGGC 


CGCCCTTTCC 


GAGAGGCTCT 


TCGCCAACCT 


GTGGGGGAGG 


1260 


m 


CTTGAGGGGG 


AGGAGAGGCT 


CCTTTGGCTT 


TACCGGGAGG 


TGGAGAGGCC 


CCTTTCCGCT 


1320 


0 

ill 


GTCCTGGCCC 


ACATGGAGGC 


CACGGGGGTG 


CGCCTGGACG 


TGGCCTATCT 


CAGGGCCTTG 


1380 


TCCCTGGAGG 


TGGCCGGGGA 


GATCGCCCGC 


CTCGAGGCCG 


AGGTCTTCCG 


CCTGGCCGGC 


1440 




CACCCCTTCA 


ACCTCAACTC 


CCGGGACCAG 


CTGGAAAGGG 


TCCTCTTTGA 


CGAGCTAGGG 


1500 




CTTCCCGCCA 


TCGGCAAGAC 


GGAGAAGACC 


GGCAAGCGCT 


CCACCAGCGC 


CGCCGTCCTG 


1560 




GAGGCCCTCC 


GCGAGGCCCA 


CCCCATCGTG 


GAGAAGATCC 


TGCAGGCATG 


CAAGCTTGGC 


1620 




ACTGGCCGTC 


GTTTTACAAC 


GTCGTGA 








1647 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION:. 





ATGAATTCGG 


GGATGCTGCC 


CCTCTTTGAG 




CACCACCTGG 


CCTACCGCAC 


CTTCCACGCC 




CCGGTGCAGG 


CGGTCTACGG 


CTTCGCCAAG 




GACGCGGTGA 


TCGTGGTCTT 


TGACGCCAAG 




GGGTACAAGG 


CGGGCCGGGC 


CCCCACGCCG 




AAGGAGCTGG 


TGGACCTCCT 


GGGGCTGGCG 




GACGTCCTGG 


CCAGCCTGGC 


CAAGAAGGCG 




ACCGCCGACA 


AAGACCTTTA 


CCAGCTCCTT 




GGGTACCTCA 


TCACCCCGGC 


CTGGCTTTGG 


M 


GCCGACTACC 


GGGCCCTGAC 


CGGGGACGAG 


0 
0 
M 


GGGGAGAAGA 


CGGCGAGGAA 


GCTTCTGGAG 


AACCTGGACC 


GGCTGAAGCC 


CGCCATCCGG 




AAGCTCTCCT 


GGGACCTGGC 


CAAGGTGCGC 


W 

w 


AAAAGGCGGG 


AGCCCGACCG 


GGAGAGGCTT 


CO 

s 


AGCCTCCTCC 


ACGAGTTCGG 


CCTTCTGGAA 


Q CCCCCGCCGG 


AAGGGGCCTT 


CGTGGGCTTT 


m 


GATCTTCTGG 


CCCTGGCCGC 


CGCCAGGGGG 


ru 


AAAGCCCTCA 


GGGACCTGAA 


GGAGGCGCGG 


D 

m 


GCCCTGAGGG 


AAGGCCTTGG 


CCTCCCGCCC 




CTGGACCCTT 


CCAACACCAC 


CCCCGAGGGG 




GAGGAGGCGG 


GGGAGCGGGC 


CGCCCTTTCC 




CTTGAGGGGG 


AGGAGAGGCT 


CCTTTGGCTT 




GTCCTGGCCC 


ACATGGAGGC 


CACGGGGGTG 




TCCCTGGAGG 


TGGCCGGGGA 


GATCGCCCGC 




CACCCCTTCA 


ACCTCAACTC 


CCGGGACCAG 




CTTCCCGCCA 


TCGGCAAGAC 


GGAGAAGACC 




GAGGCCCTCC 


GCGAGGCCCA 


CCCCATCGTG 




AAGCTGAAGA 


GCACCTACAT 


TGACCCCTTG 




CTCCACACCC 


GCTTCAACCA 


GACGGCCACG 




AACCTCCAGA 


ACATCCCCGT 


CCGCACCCCG 



2Q ID NO: 10: 

CCCAAGGGCC GGGTCCTCCT GGTGGACGGC 60 

CTGAAGGGCC TCACCACCAG CCGGGGGGAG 120 

AGCCTCCTCA AGGCCCTCAA GGAGGACGGG 180 

GCCCCCTCCT TCCGCCACGA GGCCTACGGG 240 

GAGGACTTTC CCCGGCAACT CGCCCTCATC 300 

CGCCTCGAGG TCCCGGGCTA CGAGGCGGAC 360 

GAAAAGGAGG GCTACGAGGT CCGCATCCTC 420 

TCCGACCGCA TCCACGTCCT CCACCCCGAG 4 80 

GAAAAGTACG GCCTGAGGCC CGACCAGTGG 540 

TCCGACAACC TTCCCGGGGT CAAGGGCATC 600 

GAGTGGGGGA GCCTGGAAGC CCTCCTCAAG 660 

GAGAAGATCC TGGCCCACAT GGACGATCTG 720 

ACCGACCTGC CCCTGGAGGT GGACTTCGCC 780 

AGGGCCTTTC TGGAGAGGCT TGAGTTTGGC 840 

AGCCCCAAGG CCCTGGAGGA GGCCCCCTGG 900 

GTGCTTTCCC GCAAGGAGCC CATGTGGGCC 960 

GGCCGGGTCC ACCGGGCCCC CGAGCCTTAT 1020 

GGGCTTCTCG CCAAAGACCT GAGCGTTCTG 1080 

GGCGACGACC CCATGCTCCT CGCCTACCTC 1140 

GTGGCCCGGC GCTACGGCGG GGAGTGGACG 1200 

GAGAGGCTCT TCGCCAACCT GTGGGGGAGG 1260 

TACCGGGAGG TGGAGAGGCC CCTTTCCGCT 1320 

CGCCTGGACG TGGCCTATCT CAGGGCCTTG 1380 

CTCGAGGCCG AGGTCTTCCG CCTGGCCGGC 1440 

CTGGAAAGGG TCCTCTTTGA CGAGCTAGGG 1500 

GGCAAGCGCT CCACCAGCGC CGCCGTCCTG 1560 

GAGAAGATCC TGCAGTACCG GGAGCTCACC 1620 

CCGGACCTCA TCCACCCCAG GACGGGCCGC 1680 

GCCACGGGCA GGCTAAGTAG CTCCGATCCC 1740 

CTTGGG C AG A GGATCCGCCG GGCCTTCATC 1800 
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GCCGAGGAGG GGTGGCTATT GGTGGCCCTG GACTATAGCC AGATAGAGCT CAGGGTGCTG 
GCCCACCTCT CCGGCGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGCG GGACATCCAC 
ACGGAGACCG CCAGCTGGAT GTTCGGCGTC CCCCGGGAGG CCGTGGACCC CCTGATGCGC 
CGGGCGGCCA AGACCATCAA CTTCGGGGTC CTCTACGGCA TGTCGGCCCA CCGCCTCTCC 
CAGGAGCTAG CTAGCCATCC CTTACGAGGA GGCCCAGGCC TTCATTGA 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
y h (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

P ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT GGTGGACGGC 



\\ CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG CCGGGGGGAG 

£1 

|Jj CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA GGAGGACGGG 

FU GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGGG 



GGGTACAAGG CGGGCCGGGC CCCCACGCCG GAGGACTTTC CCCGGCAACT CGCCCTCATC 
D AAGGAGCTGG TGGACCTCCT GGGGCTGGCG CGCCTCGAGG TCCCGGGCTA CGAGGCGGAC 



yk GACGTCCTGG CCAGCCTGGC CAAGAAGGCG GAAAAGGAGG GCTACGAGGT CCGCATCCTC 

ft! 

ACCGCCGACA AAGACCTTTA CCAGCTTCTT TCCGACCGCA TCCACGTCCT CCACCCCGAG 



FU GGGTACCTCA TCACCCCGGC CTGGCTTTGG GAAAAGTACG GCCTGAGGCC CGACCAGTGG 
GCCGACTACC GGGCCCTGAC CGGGGACGAG TCCGACAACC TTCCCGGGGT CAAGGGCATC 
GGGGAGAAGA CGGCGAGGAA GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC CCTCCTCAAG 
AACCTGGACC GGCTGAAGCC CGCCATCCGG GAGAAGATCC TGGCCCACAT GGACGATCTG 
AAGCTCTCCT GGGACCTGGC CAAGGTGCGC ACCGACCTGC CCCTGGAGGT GGACTTCGCC 
AAAAGGCGGG AGCCCGACCG GGAGAGGCTT AGGGCCTTTC TGGAGAGGCT TGAGTTTGGC 
AGCCTCCTCC ACGAGTTCGG CCTTCTGGAA AGCCCCAAGT CATGGAGGGG GTGTATCCCC 
TGGCCGTGCC CCTGGAGGTG GAGGTGGGGA TAGGGGAGGA CTGGCTCTCC GCCAAGGAGT 
GA 



1860 
1920 
1980 
2040 
2088 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
962 
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# 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 





ATGGAATTCG 


GGGATGCTGC 


CCCTCTTTGA 


GCCCAAGGGC 


CGGGTCCTCC 


TGGTGGACGG 


60 




CCACCACCTG 


GCCTACCGCA 


CCTTCCACGC 


CCTGAAGGGC 


CTCACCACCA 


GCCGGGGGGA 


120 




GCCGGTGCAG 


GCGGTCTACG 


GCTTCGCCAA 


GAGCCTCCTC 


AAGGCCCTCA 


AGGAGGACGG 


180 




GGACGCGGTG 


ATCGTGGTCT 


TTGACGCCAA 


GGCCCCCTCC 


TTCCGCCACG 


AGGCCTACGG 


240 




GGGGTACAAG 


GCGGGCCGGG 


CCCCCACGCC 


GGAGGACTTT 


CCCCGGCAAC 


TCGCCCTCAT 


300 


Q 

o 


CAAGGAGCTG 


GTGGACCTCC 


TGGGGCTGGC 


GCGCCTCGAG 


GTCCCGGGCT 


ACGAGGCGGA 


360 


t » 
""si 


CGACGTCCTG 


GCCAGCCTGG 


CCAAGAAGGC 


GGAAAAGGAG 


GGCTACGAGG 


TCCGCATCCT 


420 


5 s3 

4 s 

Is? 


CACCGCCGAC 


AAAGACCTTT 


ACCAGCTCCT 


TTCCGACCGC 


ATCCACGTCC 


TCCACCCCGA 


480 


ru 


GGGGTACCTC 


ATCACCCCGG 


CCTGGCTTTG 


GGAAAAGTAC 


GGCCTGAGGC 


CCGACCAGTG 


540 


00 

£ 


GGCCGACTAC 


CGGGCCCTGA 


CCGGGGACGA 


GTCCGACAAC 


CTTCCCGGGG 


TCAAGGGCAT 


600 


□ 

m 


CGGGGAGAAG 


ACGGCGAGGA 


AGCTTCTGGA 


GGAGTGGGGG 


AGCCTGGAAG 


CCCTCCTCAA 


660 




GAACCTGGAC CGGCTGAAGC 


CCGCCATCCG 


GGAGAAGATC 


CTGGCCCACA 


TGGACGATCT 


720 


m 
□ 


GAAGCTCTCC 


TGGGACCTGG 


CCAAGGTGCG 


CACCGACCTG 


CCCCTGGAGG 


TGGACTTCGC 


780 


CAAAAGGCGG 


GAGCCCGACC 


GGGAGAGGCT 


TAGGGCCTTT 


CTGGAGAGGC 


TTGAGTTTGG 


840 




CAGCCTCCTC 


CACGAGTTCG 


GCCTTCTGGA 


AAGCCCCAAG 


ATCCGCCGGG 


CCTTCATCGC 


900 




CGAGGAGGGG 


TGGCTATTGG 


TGGCCCTGGA 


CTATAGCCAG 


ATAGAGCTCA 


GGGTGCTGGC 


960 




CCACCTCTCC 


GGCGACGAGA 


ACCTGATCCG 


GGTCTTCCAG GAGGGGCGGG 


ACATCCACAC 


1020 




GGAGACCGCC 


AGCTGGATGT 


TCGGCGTCCC 


CCGGGAGGCC 


GTGGACCCCC 


TGATGCGCCG 


1080 




GGCGGCCAAG 


ACCATCAACT 


TCGGGGTCCT 


CTACGGCATG 


TCGGCCCACC 


GCCTCTCCCA 


1140 




GGAGCTAGCC 


ATCCCTTACG 


AGGAGGCCCA 


GGCCTTCATT 


GAGCGCTACT 


TTCAGAGCTT 


1200 




CCCCAAGGTG 


CGGGCCTGGA 


TTGAGAAGAC 


CCTGGAGGAG 


GGCAGGAGGC 


GGGGGTACGT 


1260 




GGAGACCCTC 


TTCGGCCGCC 


GCCGCTACGT 


GCCAGACCTA 


GAGGCCCGGG 


TGAAGAGCGT 


1320 




GCGGGAGGCG 


GCCGAGCGCA 


TGGCCTTCAA 


CATGCCCGTC 


CGGGGCACCG 


CCGCCGACCT 


1380 




CATGAAGCTG 


GCTATGGTGA 


AGCTCTTCCC 


CAGGCTGGAG 


GAAATGGGGG 


CCAGGATGCT 


1440 




CCTTCAGGTC 


CACGACGAGC 


TGGTCCTCGA 


GGCCCCAAAA 


GAGAGGGCGG 


AGGCCGTGGC 


1500 
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CCGGCTGGCC AAGGAGGTCA TGGAGGGGGT GTATCCCCTG GCCGTGCCCC TGGAGGTGGA 1560 
GGTGGGGATA GGGGAGGACT GGCTCTCCGC CAAGGAGTGA 1600 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CACGAATTCG GGGATGCTGC CCCTCTTTGA GCCCAA 36 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
M» (A) LENGTH: 34 base pairs 

Pi (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



o 



Ui 

ni 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
GTGAGATCTA TCACTCCTTG GCGGAGAGCC AGTC 34 



f=i (2) INFORMATION FOR SEQ ID NO: 15: 

fU (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 
jtl (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Rj i 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TAATACGACT CACTATAGGG AGACCGGAAT TCGAGCTCGC CCGGGCGAGC TCGAATTCCG 60 
TGTATTCTAT AGTGTCACCT AAATCGAATT C 91 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 
TAATACGACT CACTATAGGG 20 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
GAATTCGATT TAGGTGACAC TATAGAA 27 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. s (D) TOPOLOGY: linear 

O (ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 



J[ GTAATCATGG TCATAGCTGG TAGCTTGCTA C 31 

fy (2) INFORMATION FOR SEQ ID NO: 19: 

ffl 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

W (B) TYPE: nucleic acid 

fU (C) STRANDEDNESS: single 

Ls, (D) TOPOLOGY: linear 

* — 

W (ii) MOLECULE TYPE: DNA (genomic) 

0 

fy (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGATCCTCTA GAGTCGACCT GCAGGCATGC CTACCTTGGT AG 42 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGATCCTCTA GAGTCGACCT GCAGGCATGC 30 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT GGTGGACGGC 60 

CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG CCGGGGGGAG 120 

CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA GGAGGACGGG 180 

GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGGG 240 

Hi GGGTACAAGG CGGGCCGGGC CCCCACGCCG GAGGACTTTC CCCGGCAACT CGCCCTCATC 300 

H 

p AAGGAGCTGG TGGACCTCCT GGGGCTGGCG CGCCTCGAGG TCCCGGGCTA CGAGGCGGAC 360 

N GACGTCCTGG CCAGCCTGGC CAAGAAGGCG GAAAAGGAGG GCTACGAGGT CCGCATCCTC 420 

l s j ACCGCCGACA AAGACCTTTA CCAGCTCCTT TCCGACCGCA TCCACGTCCT CCACCCCGAG 480 

^ GGGTACCTCA TCACCCCGGC CTGGCTTTGG GAAAAGTACG GCCTGAGGCC CGACCAGTGG 540 

00 

e GCCGACTACC GGGCCCTGAC CGGGGACGAG TCCGACAACC TTCCCGGGGT CAAGGGCATC 600 

9 GGGGAGAAGA CGGCGAGGAA GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC CCTCCTCAAG 660 

ill 

M AACCTGGACC GGCTGAAGCC CGCCATCCGG GAGAAGATCC TGGCCCACAT GGACGATCTG 720 

nj 

P AAGCTCTCCT GGGACCTGGC CAAGGTGCGC ACCGACCTGC CCCTGGAGGT GGACTTCGCC 780 

RI AAAAGGCGGG AGCCCGACCG GGAGAGGCTT AGGGCCTTTC TGGAGAGGCT TGAGTTTGGC 840 

AGCCTCCTCC ACGAGTTCGG CCTTCTGGAA AGCCCCAAGG CCCTGGAGGA GGCCCCCTGG 900 

CCCCCGCCGG AAGGGGCCTT CGTGGGCTTT GTGCTTTCCC GCAAGGAGCC CATGTGGGCC 960 

GATCTTCTGG CCCTGGCCGC CGCCAGGGGG GGCCGGGTCC ACCGGGCCCC CGAGCCTTAT 1020 

AAAGCCCTCA GGGACCTGAA GGAGGCGCGG GGGCTTCTCG CCAAAGACCT GAGCGTTCTG 1080 

GCCCTGAGGG AAGGCCTTGG CCTCCCGCCC GGCGACGACC CCATGCTCCT CGCCTACCTC 1140 

CTGGACCCTT CCAACACCAC CCCCGAGGGG GTGGCCCGGC GCTACGGCGG GGAGTGGACG 1200 

GAGGAGGCGG GGGAGCGGGC CGCCCTTTCC GAGAGGCTCT TCGCCAACCT GTGGGGGAGG 1260 

CTTGAGGGGG AGGAGAGGCT CCTTTGGCTT TACCGGGAGG TGGAGAGGCC CCTTTCCGCT 13 20 

GTCCTGGCCC ACATGGAGGC CACGGGGGTG CGCCTGGACG TGGCCTATCT CAGGGCCTTG 13 80 

TCCCTGGAGG TGGCCGGGGA GATCGCCCGC CTCGAGGCCG AGGTCTTCCG CCTGGCCGGC 1440 

CACCCCTTCA ACCTCAACTC CCGGGACCAG CTGGAAAGGG TCCTCTTTGA CGAGCTAGGG 1500 
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CTTCCCGCCA TCGGCAAGAC GGAGAAGACC GGCAAGCGCT CCACCAGCGC CGCCGTCCTG 1560 

GAGGCCCTCC GCGAGGCCCA CCCCATCGTG GAGAAGATCC TGCAGTACCG GGAGCTCACC 1620 

AAGCTGAAGA GCACCTACAT TGACCCCTTG CCGGACCTCA TCCACCCCAG GACGGGCCGC 1680 

CTCCACACCC GCTTCAACCA GACGGCCACG GCCACGGGCA GGCTAAGTAG CTCCGATCCC 1740 

AACCTCCAGA ACATCCCCGT CCGCACCCCG CTTGGGCAGA GGATCCGCCG GGCCTTCATC 1800 

GCCGAGGAGG GGTGGCTATT GGTGGCCCTG GACTATAGCC AGATAGAGCT CAGGGTGCTG 1860 

GCCCACCTCT CCGGCGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGCG GGACATCCAC 1920 

ACGGAGACCG CCAGCTGGAT GTTCGGCGTC CCCCGGGAGG CCGTGGACCC CCTGATGCGC 198 0 

CGGGCGGCCA AGACCATCAA CTTCGGGGTC CTCTACGGCA TGTCGGCCCA CCGCCTCTCC 204 0 

CAGGAGCTAG CCATCCCTTA CGAGGAGGCC CAGGCCTTCA TTGAGCGCTA CTTTCAGAGC 2100 

Ul TTCCCCAAGG TGCGGGCCTG GATTGAGAAG ACCCTGGAGG AGGGCAGGAG GCGGGGGTAC 2160 

J 

5 GTGGAGACCC TCTTCGGCCG CCGCCGCTAC GTGCCAGACC TAGAGGCCCG GGTGAAGAGC 2220 

P 

S| GTGCGGGAGG CGGCCGAGCG CATGGCCTTC AACATGCCCG TCCGGGGCAC CGCCGCCGAC 2280 
J CTCATGAAGC TGGCTATGGT GAAGCTCTTC CCCAGGCTGG AGGAAATGGG GGCCAGGATG 2340 

W CTCCTTCAGG TCCACGACGA GCTGGTCCTC GAGGCCCCAA AAGAGAGGGC GGAGGCCGTG 2400 

ttl 

GCCCGGCTGG CCAAGGAGGT CATGGAGGGG GTGTATCCCC TGGCCGTGCC CCTGGAGGTG 2460 

S GAGGTGGGGA TAGGGGAGGA CTGGCTCTCC GCCAAGGAGT GA 2502 

ill 

U (2) INFORMATION FOR SEQ ID NO: 22: 

R I 

L!f (i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 19 base pairs 

PL1 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GATTTAGGTG ACACTATAG 19 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



- 197 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGGACGAACA AGCGAGACAG CGACACAGGT ACCACATGGT ACAAGAGGCA AGAGAGACGA 60 
CACAGCAGAA AC 72 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTTTCTGCTG TGTCGTCTCT CTTGCCTCTT GTACCATGTG GTACCTGTGT CGCTGTCTCG 60 
CTTGTTCGTC 70 

C) (2) INFORMATION FOR SEQ ID NO: 25: 

SI 

JC; (i) SEQUENCE CHARACTERISTICS: 

.-j (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
fU (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

« (ii) MOLECULE TYPE: DNA (genomic) 

□ 

fiJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

j GACGAACAAG CGAGACAGCG 

(2) INFORMATION FOR SEQ ID NO: 26: 

m 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GTTTCTGCTG TGTCGTCTCT CTTG 24 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



20 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CCTCTTGTAC CATGTGGTAC CTGTGTCGCT GTCTCGCTTG TTCGTC 46 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ACACAGGTAC CACATGGTAC AAGAGGCAAG AGAGACGACA CAGCAGAAAC 50 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 
Q (B) TYPE: amino acid 

Q (C) STRANDEDNESS: single 

t . (D) TOPOLOGY: unknown 

HP (ii) MOLECULE TYPE: protein 

u 

}1] (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

^ Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Asn Ser 

^ 1 .5 10 15 

Q 

f%\ (2) INFORMATION FOR SEQ ID NO: 30: 

^* (i) SEQUENCE CHARACTERISTICS: 
[tl (A) LENGTH: 969 base pairs 

Q (B) TYPE: nucleic acid 

py (C) STRANDEDNESS: single 

5 ^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
ATGGCTAGCA TGACTGGTGG ACAGCAAATG GGTCGGATCA ATTCGGGGAT GCTGCCCCTC 60 

TTTGAGCCCA AGGGCCGGGT CCTCCTGGTG GACGGCCACC ACCTGGCCTA CCGCACCTTC 120 

CACGCCCTGA AGGGCCTCAC CACCAGCCGG GGGGAGCCGG TGCAGGCGGT CTACGGCTTC 180 

GCCAAGAGCC TCCTCAAGGC CCTCAAGGAG GACGGGGACG CGGTGATCGT GGTCTTTGAC 240 

GCCAAGGCCC CCTCCTTCCG CCACGAGGCC TACGGGGGGT ACAAGGCGGG CCGGGCCCCC 300 

ACGCCGGAGG ACTTTCCCCG GCAACTCGCC CTCATCAAGG AGCTGGTGGA CCTCCTGGGG 360 

CTGGCGCGCC TCGAGGTCCC GGGCTACGAG GCGGACGACG TCCTGGCCAG CCTGGCCAAG 420 

AAGGCGGAAA AGGAGGGCTA CGAGGTCCGC ATCCTCACCG CCGACAAAGA CCTTTACCAG 4 80 

CTTCTTTCCG ACCGCATCCA CGTCCTCCAC CCCGAGGGGT ACCTCATCAC CCCGGCCTGG 540 
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CTTTGGGAAA AGTACGGCCT GAGGCCCGAC CAGTGGGCCG ACTACCGGGC CCTGACCGGG 600 

GACGAGTCCG ACAACCTTCC CGGGGTCAAG GGCATCGGGG AGAAGACGGC GAGGAAGCTT 660 

CTGGAGGAGT GGGGGAGCCT GGAAGCCCTC CTCAAGAACC TGGACCGGCT GAAGCCCGCC 720 

ATCCGGGAGA AGATCCTGGC CCACATGGAC GATCTGAAGC TCTCCTGGGA CCTGGCCAAG 780 

GTGCGCACCG ACCTGCCCCT GGAGGTGGAC TTCGCCAAAA GGCGGGAGCC CGACCGGGAG 840 

AGGCTTAGGG CCTTTCTGGA GAGGCTTGAG TTTGGCAGCC TCCTCCACGA GTTCGGCCTT 900 

CTGGAAAGCC CCAAGTCATG GAGGGGGTGT ATCCCCTGGC CGTGCCCCTG GAGGTGGAGG 960 

TGGGGATAG 969 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 base pairs 
H- (b) TYPE: nucleic acid 

0 (C) STRANDEDNESS : single 

p (D) TOPOLOGY: linear 

UJ (ii) MOLECULE TYPE: DNA (genomic) 

4=; 

W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 





ATGGCTAGCA 


TGACTGGTGG 


ACAGCAAATG 


GGTCGGATCA 


ATTCGGGGAT 


GCTGCCCCTC 


60 


£ 


TTTGAGCCCA 


AGGGCCGGGT 


CCTCCTGGTG 


GACGGCCACC 


ACCTGGCCTA 


CCGCACCTTC 


120 


o 
m 


CACGCCCTGA 


AGGGCCTCAC 


CACCAGCCGG 


GGGGAGCCGG 


TGCAGGCGGT 


CTACGGCTTC 


180 


M 


GCCAAGAGCC 


TCCTCAAGGC 


CCTCAAGGAG 


GACGGGGACG 


CGGTGATCGT 


GGTCTTTGAC 


240 


□ GCCAAGGCCC 


CCTCCTTCCG 


CCACGAGGCC 


TACGGGGGGT 


ACAAGGCGGG 


CCGGGCCCCC 


300 


m 


ACGCCGGAGG 


ACTTTCCCCG 


GCAACTCGCC 


CTCATCAAGG 


AGCTGGTGGA 


CCTCCTGGGG 


360 




CTGGCGCGCC 


TCGAGGTCCC 


GGGCTACGAG 


GCGGACGACG 


TCCTGGCCAG 


CCTGGCCAAG 


420 




AAGGCGGAAA 


AGGAGGGCTA 


CGAGGTCCGC 


ATCCTCACCG 


CCGACAAAGA 


CCTTTACCAG 


480 




CTTCTTTCCG 


ACCGCATCCA 


CGTCCTCCAC 


CCCGAGGGGT 


ACCTCATCAC 


CCCGGCCTGG 


540 




CTTTGGGAAA 


AGTACGGCCT 


GAGGCCCGAC 


CAGTGGGCCG 


ACTACCGGGC 


CCTGACCGGG 


600 




GACGAGTCCG 


ACAACCTTCC 


CGGGGTCAAG 


GGCATCGGGG 


AGAAGACGGC 


GAGGAAGCTT 


660 




CTGGAGGAGT 


GGGGGAGCCT 


GGAAGCCCTC 


CTCAAGAACC 


TGGACCGGCT 


GAAGCCCGCC 


720 




ATCCGGGAGA 


AGATCCTGGC 


CCACATGGAC 


GATCTGAAGC 


TCTCCTGGGA 


CCTGGCCAAG 


780 




GTGCGCACCG 


ACCTGCCCCT 


GGAGGTGGAC 


TTCGCCAAAA 


GGCGGGAGCC 


CGACCGGGAG 


840 




AGGCTTAGGG 


CCTTTCTGGA 


GAGGCTTGAG 


TTTGGCAGCC 


TCCTCCACGA 


GTTCGGCCTT 


900 




CTGGAAAGCC 


CCAAGGCCGC 


ACTCGAGCAC 


CACCACCACC 


ACCACTGA 




948 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC CAGTGAATTG TAATACGACT 60 
CACTATAGGG CGAATTCGAG CTCGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT 120 
GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG CGTAATCATG GTCATAGCTG 180 
TTTCCTGTGT GAAATTGTTA TCCGCT 206 
H (2) INFORMATION FOR SEQ ID NO: 33: 

C) 

Q (i) SEQUENCE CHARACTERISTICS: 
T= (A) LENGTH: 43 base pairs 

.2 (B) TYPE: nucleic acid 

*P (C) STRANDEDNESS: single 

UJ (D) TOPOLOGY: linear 

ft! 

(ii) MOLECULE TYPE: DNA (genomic) 
» (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Q 

pa TTCTGGGTTC TCTGCTCTCT GGTCGCTGTC TCGCTTGTTC GTC 43 
!T. (2) INFORMATION FOR SEQ ID NO: 34: 

py 

P (i) SEQUENCE CHARACTERISTICS: 

ffl (A) LENGTH: 19 base pairs 

iW (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCTGTCTCGC TTGTTCGTC 19 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GACGAACAAG CGAGACAGCG 20 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TTCTGGGTTC TCTGCTCTCT GGTC 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GACGAACAAG CGAGACAGCG ACCAGAGAGC AGAGAACCCA GAA 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
ACCAGAGAGC AGAGAACCCA GAA 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
AACAGCTATG ACCATGATTA C 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GTTCTCTGCT CTCTGGTCGC TGTCTCGCTT GTGAAACAAG CGAGACAGCG TGGTCTCTCG 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CGAGAGACCA CGCTG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG TC 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
AGAAAGGAAG GGAAGAAAGC GAAAGG 
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(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GACGGGGAAA GCCGGCGAAC G 
(2) INFORMATION FOR SEQ ID NO: 45: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
N" (D) TOPOLOGY: linear 

P 

Q (ii) MOLECULE TYPE: DNA (genomic) 

%i 

£, (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

U) GAAAGCCGGC GAACGTGGCG 

ry 

*rg (2) INFORMATION FOR SEQ ID NO: 46: 

" (i) SEQUENCE CHARACTERISTICS: 

P (A) LENGTH: 21 base pairs 

pj (B) TYPE: nucleic acid 

i= (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



rtj 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GGCGAACGTG GCGAGAAAGG A 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GC 
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(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
CCTTTCGCTC TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GC 42 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
M (D) TOPOLOGY: linear 

q (ll) MOLECULE TYPE: DNA (genomic) 

^ (ix) FEATURE: 

JP (A) NAME/KEY: modif ied_base 

txl (B) LOCATION: 8 

m (C) IDENTIFICATION METHOD: experimental 

k£ (D) OTHER INFORMATION: /evidence^ EXPERIMENTAL 
/mod_base= OTHER 

^ /note= "The A residue at this position is 2 ' -O- methyl adenosine . " 

test 

fy (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

H AGAAAGGAAG GGAAGAAAGC GAAAGGT 27 

m 

P (2) INFORMATION FOR SEQ ID NO: 50: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GCCGGCGAAC GTGGCGAGAA AGGA 24 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GGTTTTTCTT TGAGGTTTAG 20 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

GCGACACTCC ACCATAGAT 19 

(2) INFORMATION FOR SEQ ID NO: 53: 

L£ (i) SEQUENCE CHARACTERISTICS: 
~, (A) LENGTH: 19 base pairs 

fc* (B) TYPE: nucleic acid 

O (C) STRANDEDNESS: single 

\\ (D) TOPOLOGY: linear 

jjj (ii) MOLECULE TYPE: DNA (genomic) 

PJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Rf 

b CTGTCTTCAC GCAGAAAGC 19 
O (2) INFORMATION FOR SEQ ID NO: 54: 

ru 

Mi (i) SEQUENCE CHARACTERISTICS: 
hs (A) LENGTH: 19 base pairs 

J*f (B) TYPE: nucleic acid 

fcJ (C) STRANDEDNESS: single 

ftJ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GCACGGTCTA CGAGACCTC 19 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
TAATACGACT CACTATAGGG 20 
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(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: RNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GGGAAAGCUU GCAUGCCUGC AGGUCGACUC UAGAGGAUCU ACUAGUCAUA UGGAUUCUGU 60 

CUUCACGCAG AAAGCGUCUG GCCAUGGCGU UAGUAUGAGU GUCGUGCAGC CUCCAGGACC 120 

CCCCCUCCCG GGAGAGGCAU AGUGGUCUGC GGAACCGGUG AGUACACCGG AAUUGCCAGG 180 

ACGACCGGGU CCUUUCUUGG AUAAACCCGC UCAAUGCCUG GAGAUUUGGG CGUGCCCCCG 240 

H CAAGACUGCU AGCCGAGUAG UGUUGGGUCG CGAAAGGCCU UGUGGUACUG CCUGAUAGGG 3 00 

O 

P UGCCUGCGAG UGCCCCGGGA GGUCUCGUAG ACCGUGC 33 7 

J (2) INFORMATION FOR SEQ ID NO: 57: 



Ul (i) SEQUENCE CHARACTERISTICS: 
jlj (A) LENGTH: 20 base pairs 

gi (B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: single 

* (D) TOPOLOGY: linear 

s 

pj (ii) MOLECULE TYPE: DNA (genomic) 

Jj* (ix) FEATURE: 

W (A) NAME/KEY: misc_feature 

Q (B) LOCATION: 18 

Hi (C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence= EXPERIMENTAL 
/note= "The N at this position indicates the presence of a 
fluorescein dye on an abasic linker." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

CCGGTCGTCC TGGCAATNCC 20 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GTTTATCCAA GAAAGGACCC GGTCC 2 5 
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(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
CAGGGTGAAG GGAAGAAGAA AGCGAAAGGT 3 0 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
y a (D) TOPOLOGY: linear 

q 

C) 

S| (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



(ii) MOLECULE TYPE: DNA (genomic) 



yg CAGGGGGAAG GGAAGAAGAA AGCGAAAGGT 30 

Ri (2) INFORMATION FOR SEQ ID NO: 61: 

m 

B (i) SEQUENCE CHARACTERISTICS: 

fssi (A) LENGTH: 22 base pairs 

Jj{ (B) TYPE: nucleic acid 

Py (C) STRANDEDNESS: single 

U (D) TOPOLOGY: linear 



n 



(ii) MOLECULE TYPE: DNA (genomic) 



FU (ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 1. .2 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence^ EXPERIMENTAL 
/mod_base= OTHER 

/note= "The T residues at positions 1 and. 2 are amino modified T 
residues . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
TTCTTTTCAC CAGCGAGACG GG 22 
(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
ATTGGGCGCC AGGGTGGTTT TT 

22 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 
CCCGTCTCGC TGGTGAAAAG AAAAACCACC CTGGCGCCCA ATACGCAAAC CGC 53 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS : 
t = (A) LENGTH: 31 base pairs 

Q (B) TYPE: nucleic acid 

O (C) STRANDEDNESS: single 

%J (D) TOPOLOGY: linear 

+• (ii) MOLECULE TYPE: DNA (genomic) 

yj 

fl| (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Pi 

GAATTCGATT TAGGTGACAC TATAGAATAC A 
e 31 

Q (2) INFORMATION FOR SEQ ID NO-65- 

m 

Lb, (1) SEQUENCE CHARACTERISTICS: 
m (A) LENGTH: 42 base pairs 

I* 1 (B) TYPE: nucleic acid 

U (C) STRANDEDNESS: single 

fy (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GC 42 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GCCGGCGAAC GTGGCGAGAA AGGA 

24 
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(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 
CAGAAGGAAG GGAAGAAAGC GAAAGG 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

CAGGGGGAAG GGAAGAAAGC GAAAGG 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CAGGGTACAG GGAAGAAAGC GAAAGG 
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